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ABSTRACT 

We study the impact of theoretical uncertainty in the dark matter halo mass function and halo bias on dark 
energy constraints from imminent galaxy cluster surveys. We find that for an optical cluster survey like the 
Dark Energy Survey, the accuracy required on the predicted halo mass function to make it an insignificant 
source of error on dark energy parameters is ps 1%. The analogous requirement on the predicted halo bias is 
less stringent (ps 5%), particularly if the observable-mass distribution can be well constrained by other means. 
These requirements depend upon survey area but are relatively insensitive to survey depth. The most stringent 
requirements are likely to come from a survey over a significant fraction of the sky that aims to observe clusters 
down to relatively low mass, M t h ~ 10 h~ M®\ for such a survey, the mass function and halo bias must be 
predicted to accuracies of ps 0.5% and respectively. These accuracies represent a limit on the practical 

need to calibrate ever more accurate halo mass and bias functions. We find that improving predictions for 
the mass function in the low-redshift and low-mass regimes is the most effective way to improve dark energy 
constraints. 

Subject headings: cosmological parameters — cosmology: theory — galaxies: clusters — galaxies: halos 



1. INTRODUCTION 

Observations of the number density of galaxy clusters as 
a function of cluster mass and redshift are powerful probes 
of cosmology, especially the accelerated cosmological expan- 
sion caused by the cryptically dubbed dark energy. Cosmo- 
logical parameters have been recently inferred from a num- 
ber of observations employing different techniques for cluster 



identification (e.g ., Gladders et al. 2007; Mantzetal 



Henrvetal 



Mantzetal 



2009; IRozo et al.l 1201 ft iViMilininetai 



2008 



2009; 



l2009bl) . The effort to derive precise and unbiased 



constraints on cosmological parameters, particularly those de- 
scribing dark energy, requires control of various systematic 
error sources. In this paper, we estimate the precision with 
which the abundance of dark matter halos as a function of 
mass (the halo mass function) and the clustering of halos as a 
function of mass (the halo bias) must be predicted in order to 
ensure that errors in the predictions for these quantities will 
be a small fraction of the error budget in forthcoming cluster 
count cosmology efforts. 

For the current generation of cluster count surveys, such 
as the Sloan Digital Sky Survey (SDSS), the Red-Sequence 
Cluster Survey (RCS), and the Massive Cluster Survey 
(MACS), uncertainty in the mass function is unlikely a ma- 
jor concern. The dominant errors in contempo rary surveys 
are th ought to be either limited statistics (e.g., iMantz et all 
120081 for MACS) or the uncertain rel ation between observ- 
able quantities a nd cluster mass (e.g. JGladders et al.l 120071: 
IRozo et alJl20T0l for RCS and SDSS respectively). Uncer- 
tainty in the relation between clust er observables and masses 
will b e an important lim itation (e.g.. iMaiumdar & Mohrl2003l 
200 lpma"& Hul2004l2005l:IStanek et alJ2006tlLima & Hul 
20071: [Cunha 2009), and it is thought that some combination 
of empirical and theoretical insight into the observable-mass 
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relation will continue to pr ove effective in the interpretation 
of fo rthcoming data (e.g., iRravtsov et alJ 120061 : IRozo et al] 
120091) . Understanding the relation between observables and 
mass is a rapidly evolving subject, so it is difficult to antici- 
pate the level at which this will be controlled in the analyses 
of forthcoming data. Although we do not directly deal with 
observable-mass relations, we explore the dependence of our 
results on assumptions about the observable-mass relation. 

The motivations to study the uncertainty in predicting halo 
abundances and clustering are twofold. First, forthcoming 
optical surveys, such as the Dark Energy Survey (DESjjj 
the Panoramic Survey Telescope & Rapid Response Sys- 
tem (PanSTARRSfl, and the Large Synoptic Survey Tele- 
scope (LSSTfl X -ray surveys, such as the extended Roent- 
gen Survey with an Imaging Telescope Array (eRositafl 
and Sunyaev-Zeldovich (SZ) surveys, such as the Atacama 
Cosmology Telescope (ACTj3 and the South Pole Telescope 
(SPTfi all promise to expand greatly upon contemporary 
observations of clusters. These surveys will enable signif- 
icant reductions in statistical errors and provide better con- 
straints on the cluster observable-mass distribution. With 
ever-improving theoretical understanding of this distribution, 
systematic errors that are currently unimportant may be dam- 
agin g to future su r veys a nd need to be controlled. For exam- 
ple, ICrocce et al.l (1201 Oh demonstrated that current errors in 
predicted mass functions may lead to statistically significant 
systematic errors in the inferred dark energy equation of state 
from SZ surveys. 

Second, there is a significant, ongoing effort to develop 
ever more accurate predictions for the halo mass function 
(e.g. ISheth & Tormenl[l999l ISheth et al.ll200lt [Jenkins et al ' 
120011: lEvrard et all 12002b iReed et all l2003b IWarren et al 
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2008 ; iRobertson et al.l 120091: ILukic et al 



20091 ICrocce et al 



20 1C ) and halo bias (e.g.. ISheth & Tormenll 19991 ISheth etal 



ISeljak & Warrenll2004t ITinker et al.ll2008Ll2010l) . Mass 
functions and halo biases have been d erived on t heoret- 
ical grounds {Press & Schechteri [19741; iBond et al.l 
Mo & Whitdl 19961 ISheth & Tormenlll999l ISheth et al ' 



Mag giore & Riottol 120101 |2009albl see IZentnerl 12007 



1991 



2001 



C 



rocce et ; 



for a 

recent review). Fits to numerical simulations with theoreti- 
cally motivated functional forms have been provided by sev- 
eral groups, an d the curren t state - of-the-art includes th e re- 
rs b v ILukic et al.l (120071) . ITinker et al.l (120081) . and 
all (120101) . as well as the forthcoming work of the 
Los Alamos National Laboratory group (S. Bhattacharya et 
al. 2010, in preparation). Current theoretical predictions of 
the mass function are accurate at the ~ 10% — 30% level (e.g. , 
Tinker et alJl2008t IRobertson etal! 120091: iRudd et al.l 120081 
Stanek et al.l 120091) . and future efforts will continue to im- 
prove this accuracy. These efforts require numerous cosmo- 
logical numerical simulations along with requisite verifica- 
tion, validation, and considerable analysis, involving large 
commitments of both computational and human resources. 
Therefore, establishing the accuracy in the mass function and 
halo bias that are required by surveys is a particularly timely 
issue. 

In this work, we quantify how accurately halo mass func- 
tions and biases need to be predicted in order to have a negli- 
gible contribution to the error budgets of forthcoming cluster 
surveys. We begin in Section [2] with a simple demonstration 
of the potential errors that may be induced by the inaccuracy 
in the mass function. We present our mass function parame- 
terizati ons a nd cosmological parameter constraint forecasts in 
Se ctionl3.1land Sectio n l3.2l respectively. In all cases, we take 
the lTinker etal] (120081) mass function and lTinker elatl (120101) 
halo bias as a fiducial model, about which we perturb to esti- 
mate errors relevant to forthcoming optical and SZ surveys. 

We detail our results in Section |4] In summary, we find 
that for relatively near-term optical surveys like DES, or SZ 
surveys like SPT, the mass function must be calibrated to the 
w 1% level while the halo bias must be calibrated to the ss 5% 
level. The requirement on the mass function is relatively in- 
sensitive to different assumptions about the observable-mass 
distributions within the range considered in contemporary 
work, whereas the requirement on the halo bias is more sen- 
sitive. For longer-term experiments, such as a nearly half-sky 
optical survey planned for LSST, the calibration must be im- 
proved to the rj 0.5% level in the mass function and the rj 1 % 
level in the halo bias. This represents the most stringent re- 
quirement for the surveys being planned over the next decade 
and may serve as a practical endgame in the need to refine 
theoretical predictions for halo abundances and clustering. 

We also explore the halo masses and redshifts at which it is 
most important to make accurate predictions for the halo mass 
function. We find that the most effective strategy to improve 
dark energy constraints is to improve predictions at the low 
masses and low redshifts involved in the survey. Other details 
of the dependence of calibration requirements on mass and 
redshift depend upon the observable-mass distribution. We 
summarize our results and draw our conclusions in Section[5] 

As w e were completing our study, we learned of the related 
work of|Qinha_&Evrard'(2009). Although our results gener- 
ally agree with the results in this study, our study has a number 
of distinct and complementary aspects. These authors con- 
centrated on the specific parameters in the lTinker et al.l ( 120081) 
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FIG. 1. — Potential systematic errors in dark energy parameter estimates 
caused by inaccurate modeling of the mass function. The systematic errors 
are shown in units of the statistical u ncertainties expecte d from each survey. 
We assume t hat the fitting formula in [Tinker et al. (2008) describes the data, 
while that in Sheth & Tormen ( 1999) is used in the likelihood function. The 
effect on imminent cluster surveys is not negligible, especially for a wide-area 
survey like LSST. Moreover, this systematic error is representative of current 
limitations of JV-body methods, while baryonic processes a re known to cause 
shifts in halo mass functions of equal or greater magnitude i Rudd et al.l2008t 
IStanek et all2009D , 



mass function and the ISheth & Tormenl ( I 19991) halo bias, fo- 
cusing considerable study on the degeneracies between nui- 
sance parameters and cosmological parameters. In contrast, 
we aim to describe the uncertainty in mass function and halo 
bias in a manner that is independent of the form of any spe- 
cific fitting function. We discretize the mass function and 
halo bias, and assign to each mass and redshift bin a distinct 
nuisance parameter specifying the fractional deviation of the 
mass function or halo bias from the fiducial values. This pa- 
rameterization also allows us to identify the masses and red- 
shifts at which it is most fruitful to refine predictions in order 
to limit systematic errors on dark energy parameters. 

2. THE EFFECT OF INACCURACY IN THE HALO MASS FUNCTION 

We begin with an example showing the influence of inaccu- 
rate modeling of the halo mass function on the cosmological 
parameters derived from galaxy cluster surveys. We compare 
two commonly u sed fitting functions der ived f rom jV-body 
simula tions, the ISheth & Tormenl (119991) and ITinker et al] 
(120081) mass fu nctions. To constru ct a concrete example, we 
assume that the lTinker et al.1 (120081) function describes t he true 
halo mass function, while the ISheth & Tormenl (|T999j) func- 
tion serves as an example of an imperfect model in our like- 
lihood analysis. The differences between these functions ar e 
~ 20% over the most relevant masses (see lTinker et"ai1l2008l) . 
This represents a reasonable assessment o f our current ability 
to p redict the mass function . For example. ITinker et al.l (120081) 
and lRobertson et all ( 120091) discussed fundamental limitations 
in the ability of contemporary methods to construct analytical 
mass function fits at the ~ 10% level, and baryonic processes 
can give rise to additional deviations in predicted mass func- 
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tions as large as ~ 30% dRudd et al.l2008UStanek et al.l2009t) . 

We use the Fisher matrix formalism for parameter con- 
straint forecasting to compute the systematic error in dark 
energy parameters due to a mass function that is imperfectly 
calibrated. We use the term "systematic error" to describe 
the offset between inferred values of cosmological parame- 
ters and the true values of these parameters. This systematic 
error is often referred to as a "bias" in the literature, but we 
do not use this term in order to avoid confus ion with the halo 
bias. We refer the reader to the appendix of lWu et ail d2008l) 
for the derivation of the systematic errors relevant for cluster 
surveys and defer further discussion of our methods to the fol- 
lowing sections. We describe dark energy by a two-parameter 
equation of state w(a) = wo + (l -a)w a and take wq and w a as 
free parameters. 

Figure Q] shows the systematic errors in dark energy param- 
eters in terms of the statistical uncertainties. We present the 
absolute value of the error, and zero indicates that there is 
no systematic error. Three survey assumptions are shown: 
a DES-like optical survey (5000deg 2 to a maximum red- 
shift z max = 1), an LSST-like optical survey (20,000deg 2 to 
Zmax = 1), and an SPT-like SZ survey (2000deg 2 to z max = 1 .5). 
As can be seen, the dark energy equation of state parameters 
may have systematic errors at levels of approximately 20%- 
130% of their statistical uncertainties. These errors indicate 
that current inaccuracy in the halo mass function would not 
be a negligible contribution to the errors in dark energy pa- 
rameters inferred from the next generation of surveys. In fact, 
this is likely a conservative estimate of the systematic error 
because it does not include additional uncertainties induced 
by baryonic physics. 

3. METHODS 

3.1. Parameterizing Uncertainty in the Halo Mass Function 
and Halo Bias 

In this subsection, we describe our parameterization of an 
uncertain halo mass function dn/dlnM and an uncertain halo 
bias b{M), aiming to decouple our results from the particulars 
of published fits to halo mass functions and biases. For the 
mass function, we define a set of nuisance parameters fj as 
follows. We take the ratio of the actual mass function to a 
fiducial mass function at mass M and redshift z to be 



dn/dlnM 
(dn/d\nM)n& 



= fMM,z) 



(1) 



where the ipj(M,z) describes the binning in mass and redshift 
and equals unity when (M,z) is within the range specified for 
bin i. The index i runs over all bins in both the mass and 
redshift dimensions. 

As we will discuss in more detail in Section 13.21 galaxy 
cluster surveys observe halo mass proxies above some ob- 
servable threshold. In what follows, we model the theoretical 
mass function down to a minimum mass that is lower than the 
observable threshold by an amount comparable to the scatter 
in the observable-mass distribution (ctidm, which will be de- 
fined later). This low minimum mass is necessary to account 
for the scatter, which may lead relatively low-mass clusters 
with higher-than-average mass proxies to be included in an 
observed cluster sample. 

Between the minimum mass and a maximum mass, we use 
several bins for the theoretical mass function. We choose the 
width of the bins to be roughly comparable to the scatter in 



the relation between observable and mass. This scheme fol- 
lows from two considerations. First, the binning in the theo- 
retical mass function should be sufficiently fine so that con- 
straints on dark energy are maximally degraded when there is 
no prior information on any of the /, nuisance parameters. In 
this way correlated shifts in the mass function over a range 
of masses do not significantly aid in self-calibration. Sec- 
ond, very fine mass bins are unnecessary because the scatter 
in the observable-mass relation gives an effective resolution 
with which the mass function may be probed. Binning signif- 
icantly more finely means that several theoretical mass bins 
contribute to a measurement at a particular observable mass 
proxy. The value of the mass function in each bin will then 
need to be predicted with less accuracy, but the combination 
that results in the measurement will need to be predicted with 
fixed accuracy. 

Following from these considerations, it is clear that mass 
function accuracy requirements should depend upon bin size. 
We quote results that are realized when the mass function is 
binned at a resolution comparable to the scatter in the relation 
between observable and mass. This seems sensible in that it 
allows for maximal degradation of cosmological constraints 
when the mass function is uncertain, but does not sample the 
mass function significantly more finely than observations. 

For our model optical surveys, this choice amounts to five 
bins spaced evenly in InM between M m ; n = 10 133 h~ l M Q and 
M mdx = 10 153 /2 _1 Mq. For our model SZ surveys, we use 
seven mass bins above M m i„ = 10 13 9 h~ l MQ. Note that these 
binning methods for InM include one bin below the threshold 
of observable mass proxies discussed in the next section. We 
bin in redshift intervals of Az = 0.1 from z = to a maximum 
survey redshift, z max . For our fiducial mass function, we use 
the mass function fi t at a spherical overdensity of A = 200 in 
iTinker et alJ(l2008h . 

We introduce an analogous set of parameters gj to describe 
the uncertainty in halo bias around the fiducial value from 
ITinker et al] ( 120101) . 



b(M,z) 
b Fl d(M,z) 



■■gfl/>i(M,z) , 



(2) 



according to a binning scheme identical to that of the mass 
function. The halo bias and mass f unction may well be linked 
based on physical considerat i ons {K aiser 19 84t iMo & White! 
[I996t iSheth & TornienlllOOl lZentnerj|2007T ejL through the 
excursion set approach), but we treat them independently as 
a conservative way to account for the inadequacy of the as- 
sumptions of particular models. 

In our statistical forecasts, we include the parameters f and 
gi at each of the set mass and redshift bins. We then determine 
how well we must be able to set priors on the parameters /, 
and gi. We refer to the prior knowledge of the /, and gi as <jf. 
and <t s respectively. 

3.2. Implementation 

We forecast constraints on cos mological parameters u sing 
the methods described in detail in lWu et al.l d2008ll2009b . In 
what follows, we give a brief description of the aspects of this 
calculation necessary to quantify the effect of uncertainty in 
the mass function and halo bias and refer the reader to the 
aforementioned papers for a more complete description. 

We assume that a survey determines a particular mass proxy 
M bs and that the survey clusters are binned in lnM ODS accord- 
ing to a binning function </>(lnM ODS ). We assume that mass 
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proxies are related to mass through an observable-mass dis- 
tribution P(lnM D bs | InM) which specifies the probability that 
a halo with natural logarithm of mass InM gives a mass proxy 
lnM bs. As an example, consider a survey composed of a 
number of cells of volume V, each of which spans a narrow 
range in redshift. The mean number of clusters in each cell is 



where 



U«/,0,<M,z)(-^)j 



m = V / d\nM{ — — 
\d\nM j 

-^—\ (01 InM) (3) 



(0| InM) = J dlnM ohs P(lnM ohs \lnM)(j)(lnM obs ). (4) 

and the index k runs over all mass bins in the theoretical mass 
function at the corresponding redshift. 

We appl y counts-in-c ells and self-calibration as in 
iLima & Hul (12004 120051) . calculating both the expected 
counts in different bins, m, and the variance in counts, S = 
((m-m^m-m)). The Fisher matrix for a cluster survey is 
given by 

F Q/3 = m r Q C- 1 m /3 + iTr[C- 1 S !Q C- 1 S, /3 ] , (5) 

where C = diag(m) + S is the total covariance matrix. The 
comma followed by a Greek letter subscript refers to the 
derivatives with respect to model parameters, which include 
multiple /, and g, in this analysis. With this notation, the 
derivative of the mean cluster count in a particular cell with 
respect to f\ reads 

Qfh f ( dn \ 

M, ti =-=V J dlnMUM){—)j^M). (6) 

Analogous expressions hold for the variance in counts. 

We assume an observable-mass distribution 
P(lnM b s | InM) that is a Gaussian with a mean 
(lnM obs |lnM) = lnM + lnM b i as and a variance cr 2 nM . We 
parameterize the mean and variance of our observable-mass 
distribution as in ILima & Hul ( 120051) . with an additional mass 
dependence: 

lnM bias = lnM + a M ln(M/M pivot ) + a, ln( 1 + z) 

°fn m = CTRd + Pm ln(M/M pivot ) + B + B , z + B 2 z 2 + B 3 z 3 (7) 

Here M p i vot is the pivot mass characterizing the mass depen- 
dence. We fix its value to the observable threshold of the sur- 
vey and note that the uncertainty in M p i vot can be accounted 
for with Bq and lnMy so that this choice is innocuous. The 
parameter a\ a M is referred to as the observable-mass scatter 
throughout this work. This model includes eight nuisance pa- 
rameters in total: (InMo, a M , a z , (3 M , Bq, B\, B 2 , #3); their 
fiducial values are all assumed to be zero. 

We specify different surveys by an observable threshold 
M t h, a characteristic choice of observable-mass distribution, 
a maximum survey redshift z max , and a survey area A. For 
each survey assumption, we choose a bin size that is nar- 
row enough to ensure no loss of information due to binning, 
and we test the convergence of parameter constraints with bin 
size. We consider two broad classes of survey that differ in 
the method by which clusters are identified. First we consider 
typical "optical cluster survey" parameters with a relatively 



low observable threshold and a relatively broad dispersion in 
the observable-mass distribution. Second, we consider typi- 
cal "SZ cluster survey" parameters with a relatively high ob- 
servable threshold but a comparably small observable-mass 
dispersion. The details of these classes are as follows. 

1. Optically selected clusters: a-pa = 0.4, M t h = 
10 13 7 h~ l M Q , 8 bins in M obs with width Alog 10 M obs = 
0.2, andz max = 1. 

2. SZ-selected clusters: er Fid = 0.2, M th = 10 141 /r'M Q , 12 
bins in M b s with width Alog 10 M o b s = 0.1, and z max = 
1.5. 

We specifically consider a "DES-like" survey of optically se- 
lected clusters over 5000 deg 2 of sky and an "SPT-like" survey 
of SZ-selected clusters over 2000 deg 2 . We will discuss re- 
sults relevant for X-ray surveys, but we do not present distinct 
calculations for the X-ray case. As we will describe in the 
next section, we find that X-ray surveys drive requirements 
similar to or less stringent than those of SZ surveys. 

We characterize the statistical power of galaxy cluster sur- 
veys using the figure of merit (FoM) p roposed in the Repor t 
of the Dark Energy Task Force (DETF: lAlbrecht et al.ll2006h : 



FoM = 1/Vdet Co v(w ,w fl ) = [a{w a )a{w p )Y l , (8) 

where w = wq + (1 - a)w a , and w p is calculated at the pivot 
redshift at which w is best constrained. We assume a fidu- 
cial c osmology given by the WMAP5 best-fit dKomatsu et alj 
120091) with parameters: wo =-l, w a = 0, f2 DE = 0.726, £\ = 
0, n m h 2 = 0.136, fl b h 2 = 0.0227, n s = 0.960, A c = 4.54 x 
10~ 5 at k = 0.05Mpc _1 . We use a Planck prior Fisher matrix 
provided by Z. Ma and W. Hu (2008, private communication). 
This cosmic microwave background (CMB) information is in- 
cluded in all of our calculations. 

In what follows, we quote our results relative to the FoMs 
that may be attained in the limit of perfect predictions for the 
mass and bias functions of halos. For each survey, the FoM in 
this limit can span a range between a fixed observable-mass 
relation and an observable-mass relation in which all eight 
nuisance parameters must be self-calibrated. For a DES-like 
survey, these baseline figures of merit are FoM =17.1 (self- 
calibrated observable-mass relation) and FoM =139 (fixed 
observable-mass relation). In the case of an SPT-like sur- 
vey, the baseline figures of merit range from FoM = 5.7 (self- 
calibrated observable-mass relation) to FoM = 42.4 (fixed 
observable-mass relation). 

4. RESULTS 

4.1. General Mass Function and Halo Bias Requirements 

We begin by calculating the degradation in dark energy con- 
straints due to uncertainty in the mass function and halo bias. 
Figure [2] shows the ratio of the degraded FoM with respect 
to the fiducial FoM with perfectly known mass function and 
halo bias. We assume that all of the /, (see Equation (Q])) have 
the same prior, Cf, and all of the g, (see Equation (f2]i) have 
the same prior, a g . The left panels correspond to a DES-like 
survey, and the right panels correspond to an SPT-like survey. 

The top panels of Figure [2] assume no prior constraints on 
the observable-mass distribution. For a DES-like survey, less 
than 10% degradation in the FoM requires about percent-level 
precision on the mass function and halo bias. Note that the 
FoM is only fractionally degraded when the mass function 
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FIG. 2. — Degradation of the FoM due to uncertainties in mass function and halo bias. The contours and numbers correspond to the degraded FoM with respect 
to the FoM with perfectly-known mass function and halo bias. In order to quote a relatively simple result, we assume that the mass function parameters all have 
the same prior, at, and that the halo bias parameters all have the same prior, <r g . Given the statistical power of DES- and SPT-like surveys, the mass function needs 
to be predicted with a few percent precision to avoid 10% degradation in the FoM. For a perfectly known observable-mass distribution, the required precision of 
halo bias is less stringent because the information from sample variance becomes less important. Comparing a DES-like survey and an SPT-like survey, the latter 
has less stringent requirements because its smaller sky coverage and higher observable threshold result in fewer observed clusters. 



uncertainties increase by an order of magnitude. For example, 
when the uncertainty is as high as 10%, the FoM degradation 
is approximately 70%. On the other hand, for an SPT-like 
survey, the FoM is less sensitive to mass function and halo 
bias uncertainties. This is because an SPT-like survey has a 
higher observable threshold and smaller sky coverage so that 
the expected cluster counts are lower and the statistical errors 
are larger. 



The bottom panels of Figure [2] assume that the observable- 
mass distribution is perfectly constrained. Under this assump- 
tion, the requirement for mass function predictions is slightly 
less stringent than the case of a free observable-mass distribu- 
tion shown in the top panels. However, the requirement for the 
halo bias predictions is significantly less strict. This behavior 
stems from including the information from sample variance 
for self-calibrating the observable-mass distribution. When 
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FIG. 3. — Effect of survey area on the required accuracy of the mass func- 
tion and halo bias. We show the required precision in /, and g, that can avoid 
10% degradation in the FoM. The required precision is inversely proportional 
to square root of the survey area. In the context of our models, optical sur- 
veys require higher accuracy than SZ surveys because they yield larger cluster 
counts and thus have smaller statistical errors. We find that this result is fairly 
insensitive to the maximum depth of the survey and the assumptions of the 
observable-mass distribution. We mark as a star the estimated lower limit of 
the required <7f for a mass proxy with 5% scatter, which is relevant to X-ray 
surveys. 

the observable-mass distribution is uncertain, both the mass 
function and halo bias need to be well known to avoid degrad- 
ing the power of self-calibration. On the other hand, if the 
observable-mass distribution is well known, the requirement 
on the halo bias is markedly less stringent because the vari- 
ance in counts is no longer needed to calibrate this relation. 
Nevertheless, the small statistical errors on dark energy in the 
case of a well-known observable-mass distribution mean that 
stringent mass function predictions are still necessary. 

We note that although the top panels and the bottom pan- 
els show similar fractional degradations in the FoM, they 
have very different FoM values and thus very different abso- 
lute degradations. When the observable-mass distribution is 
well known, uncertainties in the mass function and halo bias 
will become the obstacle to achieve precision cosmology. On 
the other hand, when the observable-mass distribution is un- 
known, constraining the observable-mass distribution will be 
more effective to improve the FoM than constraining the mass 
function and halo bias. For detailed comparisons of uncertain- 
ties in the observable-mass distribution and those in the mass 
fu nction and halo b i as, we refer the reader to Figures 2 and 3 
in lCunha & Evrardl(l2009l) . 

It is worth reiterating that we have treated the mass func- 
tion and halo bias as completely independent, despite physi- 
cal considera tions by which these quantities should be linked. 
For example, iManera et al.l (120101) have shown that the peak- 
background split appro ach dKaiserll 19841 iMo & Whitell 19961: 
ISheth & Tormenll2002l) can be used to predict halo biases to 
~ 5% given a halo mass function. Comparing this to the re- 
sults of Figure 12 an SPT-like survey can tolerate a ~ 5% un- 
certainty in the halo bias when the mass function is known, 
for all practical purposes, perfectly. This indicates that it is 
not necessary to consider halo bias as an independent set of 
nuisance parameters in this case. Even the requirements of 
DES are not very far from this ~ 5% level of precision, sug- 
gesting that large degradations due to an uncertain halo bias 
when the mass function is well known are unlikely to happen. 

In Figure[3] we show the dependence of calibration require- 
ments on survey area. To be specific, we calculate the required 
<7f and <7 g that correspond to a 10% degradation in the FoM at 



each value of survey area A. In cases where we compute the 
required <Xf, we assume perfect knowledge of the halo bias; 
we assume the converse when computing the required a g . For 
the results shown in Figure [3] we have assumed that there are 
no priors on the nuisance parameters of the observable-mass 
distribution. 

The required <7f depends upon the survey area A roughly 
as <7f oc 1 / s/A for the following reason. In the Fisher matrix, 
the information from the data scales as A, and the information 
from priors scales as erf 2 ; therefore, the scaling of <7f is ex- 
pected from requiring comparable information from both data 
and priors. The same reasoning applies for a % . When compar- 
ing optical and SZ surveys, we can see that SZ surveys require 
less accuracy in the mass function and halo bias at fixed sky 
coverage because of the higher observable thresholds. 

We find this result to be relatively insensitive to the max- 
imum survey redshift once z max > 0.6. Above this redshift, 
when we include higher-redshift bins, the increase of dark en- 
ergy information is not as rapid as in the case of low redshift. 
In addition, when redshift increases, we introduce more nui- 
sance parameters, and these parameters need no higher preci- 
sion than those at low redshift. We also find that the results of 
Figure [3] depend only weakly on the scatter and the assumed 
priors for the observable-mass relation over a wide range of 
reasonable values. 

We do not explicitly explore the requirement for an X-ray 
cluster survey because it can be estimated from our SZ results 
by considering the scatter and observable thres hold. Since 
X-ray clusters tend to have smaller scatter (e.g. iMantz et al.l 
12009a!) than SZ clusters, we calculate the case of a full-sky 
survey with a scatter 0.05 and a constant observable thresh- 
old Mth = 10 14 1 Ii~ 1 Mq. The requirement on f t is about 0.5%, 
which is very close to the SZ result. We mark this estimate 
as a star in Figure [3] On the other hand, X-ray clusters tend 
to have a higher observable threshold than SZ clusters at high 
redshift. As we have shown, higher observable threshold re- 
quires less stringent constraints. Therefore, we claim that the 
requirement for a full-sky X-ray cluster survey will be less 
stringent than 0.5%. 

4.2. Comparing Bins 

Next, we address the relative importance of the accuracy in 
the predictions for the halo mass function at different masses 
and redshifts. There are a number of methods that could 
be used to describe the relative importance of different halo 
masses and redshifts. We quantify the relative importance as 
follows. We begin with a very nonrestrictive prior on the mass 
function at all masses and redshifts. We then choose a single 
fi and tighten the prior on this individual parameter. Subse- 
quently, we compute the percentage improvement in the dark 
energy FoM with the more restrictive prior on the mass func- 
tion in the single bin. We repeat this procedure for all mass 
and redshift bins and display the relative FoM improvements 
in Figure |U In each panel, the horizontal axis represents red- 
shift bins, and the vertical axis represents mass bins. For a 
DES-like optical survey (left panels), we start with a 10% 
prior on fj and improve it to 1%; for an SPT-like SZ survey 
(right panels), we start with a 30% prior and improve to 3%. 

The relative importance of different bins does depend upon 
the assumptions of the observable-mass distribution. We 
show the results for three different cases in Figure [4] In the 
top panels, we assume a fixed observable-mass relation. In 
the middle panels, we assume an observable-mass relation in 
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FIG. 4. — Relative importance of /, in different mass (vertical axes) and redshift (horizontal axes) bins for a DES-like survey (left) and an SPT-like survey 
(right). The number in each bin reflects the percentage improvement in the FoM that results from tightening the prior on the mass function in that bin (from 10% 
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which only InMo and Bo of Equation © are free to vary. In 
this case, the observable-mass distribution has a free mean 
and a free scatter, but these have no mass or redshift depen- 
dence. Lastly, in the bottom panels, all eight parameters in 
Equation (|7]i are free to vary so that the observable-mass re- 
lation may have significant mass and redshift dependence. 

One common trend for these different assumptions on the 
mass-observable relation is that the low-mass bins tend to be 
more important than the high-mass bins when calibrating the 
mass function. Because of the steepness of the mass func- 
tion, the low-mass bins contain more statistical power than 
the high-mass bins. In addition, because of the scatter in the 
observable-mass distribution, accurate predictions at the low 
mass end are needed to account correctly for a potentially 
large number of high-M D b s clusters that up-scatter from rel- 
atively low true masses. 

In the case of the fixed observable-mass relation, the pat- 
tern in redshift is driven by both the assumed CMB prior on 
cosmological parameters and the cluster counts. With the 
CMB prior, lowest-redshift bins tend to provide the most com- 
plementary information. Consequently, the low-redshift bins 



are highlighted as particularly important to the calibration 
of the theoretical predictions. On the other hand, the low- 
redshift bins have small counts due to the small volume (in 
our DES-like survey, the peak in the number of clusters per 
redshift interval occurs near z ~ 0.7); therefore, some of the 
higher-redshift bins also have a strong influence on the dark 
energy constraints. 

Comparing a DES-like survey and an SPT-like survey, we 
find that the lowest-redshift bins above the observable thresh- 
old (shown as a horizontal thick line) are highlighted in both 
cases. On the other hand, in the case of DES, the mass bins 
below the observable threshold are also highlighted because 
of the large scatter. These bins correspond to the greatest halo 
counts, and the up-scattering of halos from these bins will be 
a significant fraction of the cluster sample. Therefore, in the 
presence of a large scatter, the accuracy of the mass function 
below the observable threshold will impact the dark energy 
constraints. We also find that this pattern depends slightly 
on the binning in M Q b s ; when the observational bin is very 
wide, the importance of the mass function below the observ- 
able threshold is less significant because the up-scattering of 
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halos near the threshold is not well resolved. However, as 
we have already discussed, it is most beneficial to bin finely 
in observations if the observable-mass scatter can be under- 
stood even in a parameterized manner, because this results in 
maximum cosmological information. 

The other two rows shown in Figure fallow for additional 
parameter freedom in the observable-mass relation that must 
be calibrated from the survey data itself. For each survey as- 
sumption, the structure in the two lower panels of Figure |4] is 
more complex than in the top panel. One general difference is 
that the low-redshift bins are relatively more important when 
the observable-mass relation must be calibrated, and this oc- 
curs primarily for two reasons. First, the low-redshift bins 
provide complementary information to the CMB prior. Sec- 
ond, the degeneracy between the mass function and the scatter 
is stronger at low redshift because the mass function is a shal- 
lower function at low redshift. 

As a result, it is most important to set stringent priors on 
the mass function at low redshift and low mass. The rela- 
tive importance of low-mass bins depends on the scatter of 
the observable-mass distribution. In the most general case of 
a mass- and redshift-dependent observable-mass distribution, 
new parameter degeneracies emerge but the general pattern is 
altered only slightly. 

5. CONCLUSIONS 

We have studied the impact of theoretical uncertainties in 
predictions of the halo mass function and halo bias on dark 
energy constraints from upcoming cluster surveys. Our pri- 
mary conclusions are as follows. 

1 . Inaccuracy in the shape of the mass function at the 20% 
level (similar to the current state of theoretical uncer- 
tainty) can lead to significant systematic errors in dark 
energy parameter inferences. 

2. For near-term cluster surveys like DES, the mass func- 
tion must be predicted at approximately 1% accuracy 
to avoid more than 10% degradation in the resulting 
dark energy FoM. Similarly, the halo bias should be 
predicted with a precision of approximately 5%. The 
current state of uncertainty in these functions could de- 
crease the FoM of a DES -like survey by a factor of ~ 2. 
Requirements are generally less restrictive for SZ and 
X-ray efforts. 

3. A future optical survey over a significant fraction of 
the sky, like LSST, will require the most stringent con- 
straints on the theoretical predictions. In this case, the 
mass function and halo bias must be computed with an 
accuracy of ~ 0.5% and ~ 1% respectively in order to 
guarantee that the theoretical uncertainty in these quan- 
tities is a negligible contributor to the dark energy error 
budget. This represents a practical limit to the accuracy 
with which these quantities will need to be computed in 
order to interpret future survey data. 

4. Precise prediction of the mass function at the low 
masses close to the observable threshold and low red- 
shifts that will be used in the analysis of survey data is 
the most beneficial to improve dark energy constraints. 

We have considered the influence of theoretical uncertain- 
ties in comparison only to statistical errors on forthcoming 
measurements; however, additional systematic errors that will 



be present at different levels in different types of cluster sur- 
veys will make the demands on theoretical mass functions 
somewhat less restrictive. For example, the systematic errors 
due to inaccurate modeling of t he observable-mass distribu- 
tion ( e.g. non-Gaussian scatter. ICohn et al.ll2007t IShaw et al.l 
120091) . an inaccurate calibration of completeness or false de- 
tection rate, and large errors in photometric redshift estimates 
will all increase errors on dark energy parameters and reduce 
the relative influence of uncertainties in predicted halo mass 
functions and biases. The requirements we advocate are rel- 
evant when these other known sources of error do not domi- 
nate the error budget. A robust interpretation of our calcula- 
tions is that the requirements we quote render the error due 
to inaccurate predictions of halo abundances unimportant, but 
less stringent requirements may be adequate when systematic 
errors are large. However, we did show that the theoreti- 
cal requirement is relatively insensitive to knowledge of the 
observable-mass relation when this relation is well described 
by a Gaussian distribution. This suggests that the require- 
ments may not be considerably loosened if systematics are 
moderate. 

Dark matter simulations currently have ^5% uncertain- 
ties in predicting mass functions for fairly standard cos- 
mological models, even when the dark ene rgy is fixed to 
a cosmological constant dTinker et al.l 12008b . To achieve 
sub-percent level predictions in dark matter simulations 
requires a m ore careful considera tion of systematics in 
halo finding ([H eitmann et al.l 120051) and initial conditions 
dCrocce et al.l 120061) . as well as a more c areful study of the 
non-universality of the halo mass function dTinker et alj2008t 
iRobertson et al.l 120091) . This will require simulations of a 
wider range of cosmological models that go beyond the stan- 
dard cold dark matter with cosmological constant model. 
Reasonab ly exhaustive simulat i on programs are begin- 
ning (e.g. iDesiacques et al . 2009; Magg iore & Riottoll2009b ; 
| Lam & ShetM2009t lMartino et al.l2009t lPillepich et al.l | 2010 ; 
I Casarini et al.ll2009t iJennings et al.ll201Qt iGrossi et al.l 12009 ; 
lAlimi et al. feoiOl) . Even more challenging still will be un- 
derstanding the systematic uncertainties induced by bary- 
onic physics. Recent studies have shown that these effects 
can result in significant deviations in the halo number den- 
sity a nd that these effects are a function of the halo mass 
scale dRudd et alj|2008t IStanek et alj|2009l) . Although meet- 
ing the stringent constraints we advocate may be quite chal- 
lenging, pursuing further accuracy in theoretical predictions 
along with controlling various systematics in clusters surveys 
promises to continue to improve our knowledge of dark en- 
ergy. 
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